rank | frequency | n-gram |
---|---|---|
1 | 2432 | -а |
2 | 2270 | -н |
3 | 1984 | -ы |
4 | 1340 | -р |
5 | 1281 | -ә |
rank | frequency | n-gram |
---|---|---|
1 | 833 | -ан |
2 | 663 | -ың |
3 | 622 | -ар |
4 | 516 | -ән |
5 | 486 | -ға |
rank | frequency | n-gram |
---|---|---|
1 | 317 | -нда |
2 | 285 | -ның |
3 | 277 | -ары |
4 | 256 | -ына |
5 | 206 | -ған |
rank | frequency | n-gram |
---|---|---|
1 | 254 | -ында |
2 | 159 | -ының |
3 | 153 | -ендә |
4 | 127 | -рҙың |
5 | 121 | -алар |
rank | frequency | n-gram |
---|---|---|
1 | 109 | -арҙың |
2 | 68 | -ндағы |
3 | 58 | -рында |
4 | 56 | -һында |
5 | 51 | -арына |
The tables show the most frequent letter-N-grams at the ending of words for N=1…5. Everything runs in parallel to 2.2.5 Most frequent word beginnings. The aim is suffix detection instead of affix detection.
For N=3:
SELECT @pos:=(@pos+1), xx.* from (SELECT @pos:=0) r, (select count(*) as cnt ,concat("-", right(word,3)) FROM words WHERE w_id>100 group by right(word,3) order by cnt desc) xx limit 5;
2.2.5 Most frequent word beginnings